Name | Version | Summary | date |
swebench |
3.0.5 |
The official SWE-bench package - a benchmark for evaluating LMs on software engineering |
2025-02-01 22:33:00 |
mteb |
1.31.8 |
Massive Text Embedding Benchmark |
2025-02-01 16:03:30 |
pytest-codspeed |
3.2.0 |
Pytest plugin to create CodSpeed benchmarks |
2025-01-31 14:28:26 |
airflow-parse-bench |
1.0.1 |
Easily measure and compare your Airflow DAGs' parse time. |
2025-01-26 03:39:23 |
mrna-bench |
1.0.1 |
Benchmarking suite for mRNA property prediction. |
2025-01-23 23:15:44 |
opencompass |
0.4.0 |
A comprehensive toolkit for large model evaluation |
2025-01-22 06:42:16 |
folktexts |
0.0.27 |
Use LLMs to get classification risk scores on tabular tasks. |
2025-01-17 16:27:47 |
fusion-bench |
0.2.9 |
A Comprehensive Benchmark of Deep Model Fusion |
2025-01-17 06:57:43 |
pydftracer |
1.0.8 |
I/O profiler for deep learning python apps. Specifically for dlio_benchmark. |
2024-12-17 03:37:11 |
rdt |
1.13.2 |
Reversible Data Transforms |
2024-12-16 22:46:10 |
qpbenchmark |
2.4.0 |
Benchmark for quadratic programming solvers available in Python. |
2024-12-16 09:24:00 |
ms-opencompass |
0.1.5 |
A lightweight toolkit for evaluating LLMs based on OpenCompass. |
2024-12-16 08:05:22 |
mlrb-agent-tasks |
0.0.23 |
A task package for ML Research Bench |
2024-12-10 16:21:44 |
EpiLog |
1.1.2 |
Simple No-Frills Logging Manager |
2024-12-06 21:32:56 |
nodespecs |
0.1.1 |
The specs summarize utilities for computer instance |
2024-12-06 15:43:45 |
Younger |
0.0.1a2 |
A Younger Project for Artificial Intelligence: Datasets, Benchmarks, and Applications. |
2024-11-25 08:01:45 |
syntherela |
0.0.3 |
SyntheRela - Synthetic Relational Data Generation Benchmark |
2024-11-21 06:20:37 |
cmdbench |
0.1.22 |
Quick and easy benchmarking for any command's CPU, memory, disk usage and runtime. |
2024-11-20 06:53:26 |
construe |
0.2.0 |
An LLM inferencing benchmark tool focusing on device-specific latency and memory usage |
2024-11-13 03:52:24 |
guardbench |
1.0.0 |
GuardBench: A Large-Scale Benchmark for Guardrail Models |
2024-11-12 02:44:56 |